Experiments in Character-Level Neural Network Models for Punctuation

نویسندگان

  • William Gale
  • Sarangarajan Parthasarathy
چکیده

We explore character-level neural network models for inferring punctuation from text-only input. Punctuation inference is treated as a sequence tagging problem where the input is a sequence of un-punctuated characters, and the output is a corresponding sequence of punctuation tags. We experiment with six architectures, all of which use a long short-term memory (LSTM) network for sequence modeling. They differ in the way the context and lookahead for a given character is derived: from simple character embedding and delayed output to enable lookahead, to complex convolutional neural networks (CNN) to capture context. We demonstrate that the accuracy of proposed character-level models are competitive with the accuracy of a state-of-the-art word-level Conditional Random Field (CRF) baseline with carefully crafted features.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Adsorption of Fe (II) from Aqueous Phase by Chitosan: Application of Physical Models and Artificial Neural Network for Prediction of Breakthrough

Removal of Fe (II) from aqueous media was investigated using chitosan as the adsorbent in both batch and continuous systems. Batch experiments were carried out at initial concentration range of 10-50 mg/L and temperature range of 20–40˚C. In batch experiments, maximum adsorption capacity of 28.7 mg/g and removal efficiency of 93% were obtained. Adsorption equilibrium data were well-fitted with ...

متن کامل

Predictions of Tool Wear in Hard Turning of AISI4140 Steel through Artificial Neural Network, Fuzzy Logic and Regression Models

The tool wear is an unavoidable phenomenon when using coated carbide tools during hard turning of hardened steels. This   work focuses on the prediction of tool wear using regression analysis and artificial neural network (ANN).The work piece taken into consideration is AISI4140 steel hardened to 47 HRC. The models are developed from the results of experiments, which are carried out based on De...

متن کامل

Neural Network Based Recognition System Integrating Feature Extraction and Classification for English Handwritten

Handwriting recognition has been one of the active and challenging research areas in the field of image processing and pattern recognition. It has numerous applications that includes, reading aid for blind, bank cheques and conversion of any hand written document into structural text form. Neural Network (NN) with its inherent learning ability offers promising solutions for handwritten characte...

متن کامل

A Bi-level Formulation for Centralized Resource Allocation DEA Models

In this paper, the common centralized DEA models are extended to the bi-level centralized resource allocation (CRA) models based on revenue efficiency. Based on the Karush–Kuhn–Tucker (KKT) conditions, the bi-level CRA model is reduced to a one-level mathematical program subject to complementarity constraints (MPCC). A recurrent neural network is developed for solving this one-level mathematica...

متن کامل

Handwritten Character Recognition using Modified Gradient Descent Technique of Neural Networks and Representation of Conjugate Descent for Training Patterns

The purpose of this study is to analyze the performance of Back propagation algorithm with changing training patterns and the second momentum term in feed forward neural networks. This analysis is conducted on 250 different words of three small letters from the English alphabet. These words are presented to two vertical segmentation programs which are designed in MATLAB and based on portions (1...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017